Model Selection

Image embedding extraction

# Image embedding extraction

Vit Small Patch14 Reg4 Dinov2.lvd142m

A visual Transformer (ViT) image feature model with registers, pre-trained using the self-supervised DINOv2 method on the LVD-142M dataset.

Image Classification

Vit Large Patch16 224 In21k

A Vision Transformer model pretrained on the ImageNet-21k dataset, suitable for image feature extraction and downstream task fine-tuning.

Image Classification

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase